Model Evaluation, Leaderboards, Capability Assessment, AI Competition
AI is getting out of its limits 🤯
threadreaderapp.com·10h
Confidence in AI Results (CAIR)
blog.langchain.com·10h
What I Learned About AI at the World’s Most Famous Museum
kill-the-newsletter.com·4h
From Messy Shelves to Master Librarians: Toy-Model Exploration of Block-Diagonal Geometry in LM Activations
lesswrong.com·20h
Loading...Loading more...